Add --metrics CLI flag to filter which metrics run by Lifto · Pull Request #3 · emac-E/lightspeed-evaluation

Lifto · 2026-04-06T16:36:38Z

Summary

Adds --metrics CLI argument to lightspeed-eval that filters each turn's turn_metrics to only the specified metrics
Example: --metrics custom:answer_correctness runs only correctness, skipping all RAGAS metrics
Works like --tags and --conv-ids — filters after loading, before validation

Motivation

Running the full metric suite (5 metrics per question) takes ~37 minutes. When we only need answer correctness for comparison runs, this drops to ~6 minutes without editing YAML configs.

Changes

runner/evaluation.py: Add --metrics arg to argparse, pass to load_evaluation_data
core/system/validator.py: Add metrics parameter, filter turn_metrics lists after scope filtering

Usage

# Run only answer correctness (skip RAGAS metrics)
lightspeed-eval --eval-data config/CLA_tests.yaml --metrics custom:answer_correctness

# Run two specific metrics
lightspeed-eval --eval-data config/CLA_tests.yaml --metrics custom:answer_correctness ragas:faithfulness

# No flag = run all metrics (existing behavior, unchanged)
lightspeed-eval --eval-data config/CLA_tests.yaml

emac-E

This is a good idea, thanks!

Allows running a subset of configured metrics without editing YAML configs. Example: --metrics custom:answer_correctness to skip RAGAS metrics.

delete old scripts/evaluation, add README

Add --metrics CLI flag to filter which metrics run

emac-E approved these changes Apr 6, 2026

View reviewed changes

Add --metrics CLI flag to filter which metrics run

f05d54d

Allows running a subset of configured metrics without editing YAML configs. Example: --metrics custom:answer_correctness to skip RAGAS metrics.

Lifto force-pushed the feat/metrics-cli-filter branch from b3da4ad to f05d54d Compare April 6, 2026 17:50

emac-E merged commit 7aceea5 into emac-E:main Apr 6, 2026
5 of 15 checks passed

emac-E pushed a commit that referenced this pull request Apr 10, 2026

Merge pull request #3 from VladimirKadlec/merge-ols-road-evals

72dc212

delete old scripts/evaluation, add README

emac-E added a commit that referenced this pull request Apr 10, 2026

Merge pull request #3 from Lifto/feat/metrics-cli-filter

feff7b9

Add --metrics CLI flag to filter which metrics run

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add --metrics CLI flag to filter which metrics run#3

Add --metrics CLI flag to filter which metrics run#3
emac-E merged 1 commit into
emac-E:mainfrom
Lifto:feat/metrics-cli-filter

Lifto commented Apr 6, 2026

Uh oh!

emac-E left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Lifto commented Apr 6, 2026

Summary

Motivation

Changes

Usage

Uh oh!

emac-E left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants